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Abstract 

A preferential attachment model for a growing network incorporating deletion of edges 
is studied and the expected asymptotic degree distribution is analyzed. At each time 
step t — 1 , 2 ,..., with probability tti > 0 a new vertex with one edge attached to 
it is added to the network and the edge is connected to an existing vertex chosen 
proportionally to its degree, with probability 7r2 a vertex is chosen proportionally to 
its degree and an edge is added between this vertex and a randomly chosen other 
vertex, and with probability 713 = 1 — tti — 7r2 < 1/2 a vertex is chosen proportionally 
to its degree and a random edge of this vertex is deleted. The model is intended 
to capture a situation where high-degree vertices are more dynamic than low-degree 
vertices in the sense that their connections tend to be changing. A recursion formula is 
derived for the expected asymptotic fraction pk of vertices with degree k, and solving 
this recursion reveals that, for tts < 1/3, we have pk ^ /j-(3-77r3)/(i-37r3)^ while, for 
TTs > 1/3, the fraction pfc decays exponentially at rate (tti -|-7r2)/27r3. There is hence a 
non-trivial upper bound for how much deletion the network can incorporate without 
loosing the power-law behavior of the degree distribution. The analytical results are 
supported by simulations. 

Keywords: Preferential attachment, preferential deletion, complex networks, random 
graphs, degree distribution. 
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1 Introduction 

During the last decade there have been tremendous advances in the study of large complex 
networks; see for surveys. Empirical observations on real networks such as social 

contact structures, citation networks and the Internet/World Wide Web have revealed that 
many networks share similar properties, in particular, they typically have power-law degree 
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sequences, that is, the number of vertices with degree k decays as an inverse power of k] 
see e.g. [I71[I91|26]. One way of modeling networks is by using random graphs, and large 
efforts have been made to formulate graph models that can capture the properties of real 
networks. The classical Erdos-Renyi graph, see [16], is discarded in this context because of 
its much too simplified structure. A specific point where it fails to capture the empirical 
observations is the degree distribution (which is a Poisson distribution in the limit in 
the Erdos-Renyi graph). To obtain a richer structure, the Erdds-Renyi model has been 
generalized in various ways, for instance by making the edge probabilities inhomogeneous, 
see e.g. [5lE|. Other model types that have been studied include the configuration model, 
see Ell ESI, and certain bipartite graphs, see [ini[25]. 

The models mentioned above are all static in the sense that the number of vertices is 
taken to be constant (albeit large). In reality however many networks are dynamic: the 
number of vertices increases as new vertices arrive in the network and decays as vertices 
leave the network. A framework for growing networks, based on the idea that new vertices 
tend to connect to high-degree vertices rather than low-degree vertices, is provided by 
preferential attachment models, introduced in the context of complex network modeling 
by Barabasi and Albert |2| 1999 to describe the World Wide Web. Barabasi and Albert 

observed that such a mechanism leads to power-law degree sequences, and their findings 
have later been confirmed and elaborated on in Ej. The model type has received a lot of 
attention, see e.g. [SlIIllEo] for surveys, and the growth mechanism has been generalized 
in various ways, see e.g. |8] for a very general version of the model. 

There is considerably less work on preferential attachment based models where edges 
and/or vertices also can be removed. In [12] a model is studied where edges between 
existing vertices are added or removed each time a vertex is added. Addition and deletion 
can however not be incorporated simultaneously in this model. Later, Cooper et al. [9] 
and Chung and Lu [7] have studied models where vertices and edges are deleted randomly, 
and shown that this leads to power-law degree sequences with exponents that depend on 
the addition/removal probabilities. Eurthermore, Moore et al. |23j and Cami and Deo [11] 
analyze models where vertices are deleted. 

In this paper we consider a model where edges can be deleted and new edges created 
between existing vertices, the deletion and addition of edges occurring at vertices chosen 
preferentially proportionally to their degree. The edge deletion/addition is hence done 
preferentially with a vertex focus: we pick the vertex to lose/gain an edge. This is intended 
to describe a scenario where high-degree vertices tend to have more dynamic connections as 
compared to low-degree vertices, that is, the connections of high-degree vertices are more 
prone to break, but high-degree vertices are also more likely to create new connections to 
existing vertices. It is not hard to come up with real networks where this is likely to be 
the case. In a social network for instance, very social individuals with a large number of 
contacts may have weaker bonds to their acquaintances as compared to individuals with 
very few contacts. On the other hand, the fact that social individuals have many contacts 
makes it more likely for them to come in contact with other people in the population and 
create new bonds as well. 

In order for the model to define a growing network, the edge removal probability, 
denoted by tts, is assumed to be smaller than 1/2. Interestingly, it turns out that the model 
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exhibits a transition at tts = 1/3: for tts < 1/3 the expected degree distribution obeys a 
power-law with an exponent that diverges as vra approaches 1/3, while, for tts > 1/3, the 
degree distribution decays exponentially. There is hence a non-trivial limit for the removal 
probability above which the power-law behavior of the degrees is lost. Furthermore, the 
exponent in the power-law regime of the model turns out to depend on the probability 
TTi of adding a vertex and the probability 1^2 of adding an edge between existing vertices 
only through their sum, that is, the relation between the vertex and the edge addition 
probabilities does not affect the tail behavior of the degree distribution (although it does 
affect the mean degree of course). 

2 The model 

The model that we will consider is described by a graph process {G(t)}t>i consisting of 
simple graphs, that is, graphs without self-loops and multiple edges. Write Vt and Et for 
the set of vertices and edges, respectively, in G{t) and let vt = \Vt\ and et = \Et\. The 
graph G(l) is taken to be a graph consisting of two vertices connected by an edge and, for 
t > 1, the graph G{t -|- 1) is constructed from G{t) in that one of the following three steps 
is carried out: 

1. With probability tti > 0, a vertex with one edge attached to it is added to the 
graph. The edge is connected to an existing vertex w with probability proportional 
to degree, that is, 

P(tc = u) = for u € Vt, 

^ ^ 2et 

where dt{u) denotes the degree of vertex u in G{t). 

2. With probability 7r2, an edge is added between a vertex chosen proportionally to its 
degree and another vertex chosen uniformly at random. If this results in a multiple 
edge between two vertices, the edges are merged. 

3. With probability tts = 1 — tti — 7r2, an edge is deleted from the graph in that a vertex 
is chosen proportionally to its degree and an edge chosen uniformly at random from 
the set of all edges incident to this vertex is removed. 

When et = 0, modifications of the rules are needed: for a step of type 1, the edge is 
connected to a randomly chosen vertex, for a step of type 2, an edge is added between two 
randomly chosen vertices and, for a step of type 3, the graph is left unchanged. However, 
throughout, we will assume that < 1/2 so that the number of edges in the graph is 
indeed growing (on average). This means that a graph without any edges occurs very 
rarely and how we deal with this case should not matter for the results derived here. 
Indeed, the graph G{t) can have et = 0 only if at least \t/2] steps have been of type 3 
and, for < 1/2, the probability of this decays exponentially in t. We can hence disregard 
this possibility in the below derivations. 

Note that, when 7r2 = 0, a vertex that ends up having degree 0 at some point will, with 
a probability tending to 1, keep on having degree 0 forever. When 7r2 > 0 on the other 
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hand, a vertex with degree 0 can acquire new edges (by being selected as an endpoint of 
an edge added as a result of step 2). Also note that the step 3 is equivalent to deleting an 
edge uniformly at random from the set of edges: First a vertex u is chosen with probability 
dt{u)j2et and then an edge incident with u is chosen randomly, that is, each incident edge 
has probability l/dt{u) of being chosen. The probability that a given edge in the graph is 
chosen via any of its two endpoints is hence l/ct- 

3 Expected asymptotic degree distribution 

Write Nk{t) for the number of vertices with degree k in G{t) and let IE[-] denote mathemat¬ 
ical expectation. Assume that K[Nk{t)]/t converges to some limit as t —)■ oo. In view 
of previous work on related models, e.g. [8] and [9], this assumption is indeed reasonable. 
It can presumably be confirmed rigorously by a more careful analysis of ([T]) along similar 
lines as in [91^1 . We will analyze the behavior of pk as k ^ oo. 

3.1 A recursion formula for pk 

Given G{t), the average change in the {t + l):th step in the number of vertices of degree 
k (A: = 0,1,...) is given by 

+ 1) - Nkit)\G{t)] = TTiCg + 7r2Cg + TTgCg + n,l{k=i}, (1) 

where G^l denotes the average change if the step is of type i {i = 1,2,3). Defining 
A_i(t) := 0 for all t, we have 

(fc-l)iVfc_i(t) kNkjt) 

2et 2et 

{k - l)jVfc_i(f) jVfc-i(t) _ kNkjt) _ Nkjt) 

2et vt 2et vt 

2{k + l)Nk+i{t) _ 2kNk{t) 

2et 2et 

(3) 

Here, in deriving the expression for G^ j., we have used the fact that the fraction of neigh¬ 
bors with degree k of the vertex chosen in step 3 is given by 

dt{u) Nk{t;u) 1 kNk{t) 

ueVt ueVt 

where Ni^{t;u) is the number of neighbors with degree k of the vertex u at time t (also 
recall that dt{u) denotes the degree of u at time t). To obtain a recursion for p^, we take 

'^We have that 'E[Nk{t)]/t satisfies the recursion obtained from ([T} conditional on that the number 
of vertices and edges respectively in the graph are concentrated around their means. With sufficient 
concentration bounds at hand it would then follow that the solution of the recursion, which we denote by 
Pk, is a good approximation of ¥,[Nk{t)]/t for large t so that, in particular, ¥.[Nk{t)]/t —>■ c.f. [51 Lemma 
5.2]. 


_ 

^( 2 ) _ 


py(3) _ 

'-'k.t ~ 
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expectation in ([T|) and let t ^ oo. Ignoring the effects of merging multiple edges in step 
2 and the modifications of the growth rule when e* = 0 — indeed, both these phenomena 
are negligible in the limit of large t — we have that is a sum of t independent identically 
distributed variables with mean vri + 7r2 — tts. This means that E[et]/t —>• tti + 7r2 — tts and 
etjt —>• vTi + 7r2 — vTa in probability (that is, P(|^ + t ^2 — tts)! > e) —>■ 0 for any e > 0). 

Furthermore, we have that AIfc(t)/2ej < 1/k. Hence, assuming that the convergence of 
Nk(t)/t to pk holds also in probability, it follows from bounded convergence (see e.g. [T4] i 
that 


E 


Nkjt) 

2et 


Pk 

2(7ri + 7r2 — TTs) 


( 2 ) 


The other terms of the same form are dealt with analogously. Similarly K[vt]/t —tti 
and vt/t TTi in probability, and hence E,[Nk{t)/vt] —>■ Pul'll- Also note that, since 
IE[A^fc(i)] ~ tpk for large t, we have E[Nk{t + 1) - A^A:(i)] ~ (t + l)Pk - tPk = Pk- Merging 
terms together and defining p_i := 0, this leads to the recursion 


[a2(fe+2)+/32bfc+2+[ai(^+l)+/3i]Pfc+i+[aofc+/3o]Pfc = 7ri(7ri+7r2-7r3)l{fc=o} for ^ > “I; 

(3) 

where 

a2 = 27ri7r3 /32 = 0 

ai = -7ri(7ri + 7r2 + 27r3) /3i = -2(7ri + 7r2)(7ri + 7r2 - tts) 

ao = 7ri(7ri + 7r2) /3o = 27r2(7ri + 7r2 - tts). 

To get a first idea of the asymptotic behavior of the solution to ([3]) , divide both sides 
by {k + l)pk+i and take the limit as /c —>■ oo. Writing \im.k^aoPk+i/Pk = x, this yields the 
equation a 2 X^ + aix + oq = 0, which is solved by x = 1 and x = ao/<a 2 = (tti + 7r2)/27r3. 
Here x = 1 corresponds to a power-law tail, while, for tti -|- 7r2 < 27r3, the latter solution 
corresponds to exponential decay at rate (tti -|- 7r2)/27r3. We now analyze the solution of 
Q in more detail. 


3.2 Solving the recursion 

The recursion formula ([3]) is of second order and hence, imposing the two conditions 
= 1 and p-i = 0 yields a unique solution. We will use the method of Laplace, see 
e.g. [IHl p.579], to solve the homogeneous equation 

[a 2 {k + 2) + P 2 ]Pk +2 + [oti{k + 1) + I5i]pk+i + [aok + l5o\pk = ^ for/c > 1. (4) 

The solution space of this equation is two-dimensional but typically only one of the solu¬ 
tions is convergent. It can then be seen along the same lines as in [9l Lemma 6.2] that, 
if Pk is a convergent solution of dH, then there is a constant c > 0 such that cpk solves 
the full recursion ([3]) under the conditions YlPk = ^ and p-i = 0. To analyze the tail 
behavior of the solution we can hence restrict attention to the solution of the homogeneous 
equation. 
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Following the method of Laplace, we substitute 

rb 


/ 


Pk= I ^h{t)dt, 


and note that integration by parts of ([S]) yields 


j-b 

kpk = - / t^h'{t)dt. 

J a 


( 5 ) 


( 6 ) 


Furthermore, define 


V'a(t) := a2t^ + ait + ao = 02(1 - t) 

:= + Pit + Pq = Pit + Pq. 

By substituting ([5|) and ([6]) into ([1]) it can be seen that the homogeneous equation ([1]) is 
satisfied if a, b and h{t) are chosen such that 


and 


The differential equation 


where 


[t^h{t)ip^{t)]’^^ = 0 

h'jt) ^ -0/3 (t) 
h{t) t^pait)' 

can be integrated and is solved by 

h{t) = t^^{i-ty^ (^ 

V«2 


_ _ 27r2(7ri+7r2 —ttb) 

Q;0 "TTi (7ri+7r2) 

^ _ /3o+/3i _ 2(7ri+7r2-7r3) 

a2—OLQ 


(7) 

( 8 ) 


7ri+7r2— 2-773 

_ /3oQ2+/3iQ:o _ 2(771+7r2-7r3)(27r27r3-(7ri+7r2)^) 

'3 ao{a2—ao) 771 (771+772) (771+772—2-773) 

Note that 71 + 72 + 73 = 0 and 71 > 0. 

The power-law regime. Assume that < 1/3, that is, 27r3 < tti + 7 r2, so that 
equivalently ao/c ^2 > 1- Then 72 > 0 and ([7]) is satisfied with a = 0 and 6=1. Using 
the notation x to denote that the left hand side is bounded from above and below by 
constants times the right hand side, and the notation ~ to denote that the quotient of the 
right hand side and the left hand side converges to a constant, we get that 

"1 / \ 73 


Pk = 


J -t 

^\fc+7i-l(i_^)72^i 

Jo 

r(fc + 7 i)F(i + 72 ) 

r(A: + 1 + 71+72) 

j(.-(l+72)^ 


dt 
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Here 


1+72 


3-77r3 
1 - Svrs ’ 


which is monotonically increasing on the interval tts € [0,1/3). For tts = 0 we get 1 + 72 = 
3, and the exponent then diverges as tts ^ 1/3. Figure [T] shows simulations of the model for 
two different sets of parameter values, both with tts < 1/3. As can be seen, the agreement 
with the analytical prediction is good. The reason for plotting the tail probabilities of the 
degree distribution is to reduce the noise. 


Exponential decay. Now assume that tts > 1/3, that is, 27r3 > tti + 7 r 2 so that hence 
«o /«2 < 1 and 72 < 0. Furthermore, assume that 73 > —1. Then ([7|) is satisfied with 
a = 0 and 6 = 00/02 and we get 


^-( 1 + 73 )^ 

that is, pk decays exponentially at rate ao /«2 = (tJ"! + 7r2)/27r3, with an extra power-law 
factor jf extra assumption that 73 > —1 is not satisfied the above described 

method will instead result in that can be expressed in terms of a certain contour 
integral. A detailed description of the method in the contour integral case together with 
the corresponding analysis is given in [H Section 4]. By using [ 8 l Eq. (24)] we can again 
ascertain that pk ~ ( 00 /^ 2 )^ large values of k. Figure [2| shows simulations of 

the model for two different parameter values with tts > 1/3. 



3.3 Special cases. 

Let ' 7 r 2 = 0 and write tti = p and tts = 1 — tti = ( 7 , with p > 1/2, that is, at each step either 
a vertex with one edge attached to it is added to the graph or an edge is removed. Then, for 
q < 1/3, we have pk ~ 7 c“( 3 - 7 q)/{i- 3 q) for q > 1/3, we have pk ~ {p/2q)^ (ignoring 

the power-law factor). This model is identical to a special case of a model treated in [9] 
when setting = q,ai = p and a = p in their notation. The paper [9] is only concerned 
with the power-law regime, but in that situation the results agree. Another special case 
is when 7 r 3 = 0 so that, at each time step, either a vertex with one edge attached to it 
is added to the graph or an edge is added. In this situation p^ ~ k~^ regardless of the 
values of tti and tt 2 . This model is a sub-model of the general model treated in [ 8 ] when 
the parameters of the model there are set according to a = 7 r 2 ,pi = (71 = 1,/3 = 0 , J = 0 
and 7 = 1 . Again our results agree with the existing ones. When 7 r 2 = tts = 0 we obtain 
the standard preferential attachment model with pk ~ k~^. 
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4 Summary and discussion 


We have analyzed a dynamically evolving graph where, in addition to the standard pref¬ 
erential attachment step, edges are also sent out and deleted from existing vertices that 
are chosen proportionally to their degree. High-degree vertices hence tend to experience 
a higher versatility in their connections as compared to low-degree vertices. Assuming 
that the expected number of vertices with degree k converges to a limit pk as time goes to 
infinity, we have analyzed the behavior of as A: —> oo. It turns out that pk decays expo¬ 
nentially if the removal probability exceeds 1/3, while it has a power-law tail for smaller 
values of the removal probability, the exponent going to infinity as the removal proba¬ 
bility approaches 1/3. Since real networks typically exhibit power-law degree sequences 
(with fairly small exponents) this indicates that these networks are the outcomes of edge 
addition/deletion processes where the addition rate well exceeds the deletion rate. 

As for future work, the results in the paper may be possible to make rigorous along 
similar lines as in [9]. Concentration results seem to be difficult to establish for prefer¬ 
ential attachment models incorporating deletion, but some recently observed connections 
between preferential attachment models and branching processes in continuous time might 
be useful in this context. It would indeed also be interesting to study other versions of 
preferential attachment models that incorporate deletion of edges. One option would be 
to assign a random life-time to each edge that is added to the network and remove the 
edge when this time expires, generalizing the work in m- One could also think of a rule 
where the probability of removing an edge is large just after the edge has been added, 
but then gradually decreases. This is to model a situation where the connections tend to 
stabilize — in a social network for instance, not all new acquaintances turn into long-term 
friends, but a relation that has lasted for a while may be more stable. 

Another line of investigation would be to look at dynamical features of the graph, such 
as for instance the evolution of the degree of a fixed vertex. As previously mentioned, the 
possibility for a vertex with degree zero of acquiring new edges depends on whether 7r2 is 
zero or positive. It has been pointed out to us by an anonymous referee that this could 
lead to interesting differences in the behavior of the asymptotic degree of a fixed vertex 
in the exponential regime. This would be worthwhile to investigate further. 
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Eigure 1: Simulation of the model in the power-law regime. The figure shows the tail 
probabilities of the degree distribution when tti = 0.3,7 r2 = 0.5 and vra = 0.2 (triangles), 
and vTi = 0.5,7r2 = 0.3 and vra = 0.2 (diamonds) together with the theoretical power-law 
tail (solid line). These choices of vr both give power-law exponent 4. Each of the simulated 
curves correspond to a single realization of the process with the number of time steps 
taken such that the expected number of nodes equals 30,000. 
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Figure 2: Simulation of the model in the exponential decay region. The figure shows 
the tail probabilities of the degree distribution when vri = 0.1,7r2 = 0.5 and tts = 0.4 
(triangles), and vri = 0.5,7r2 = 0.1 and tts = 0.4 (diamonds). Each of the simulated curves 
correspond to a single realization of the process with the number of time steps taken such 
that the expected number of nodes equals 30,000. 


11 



